Robust, Risk-Sensitive, and Data-driven Control
ثبت نشده
چکیده
Markov Decision Processes (MDPs) model problems of sequential decision-nmaking under uncertainty. They have been studied and applied extensively. Nonetheless, there are two major barriers that still hinder the applicability of MDPs to many more practical decision making problems: * The decision maker is often lacking a reliable MDP model. Since the results obtained by dynamic programming are sensitive to the assumed MDP mnodel, their relevance is challenged by model uncertainty. * The structural and computational results of dynamic programming (which deals with expected performance) have been extended with only limited success to accommnodate risk-sensitive decision makers. In this thesis, we investigate two ways of dealing with uncertain MDPs and we develop a new connection between robust control of uncertain MDPs and risk-sensitive control of dynamical systems. The first approach assumes a model of model uncertainty and forimulates the control of uncertain MDPs as a problem of decision-making under (model) uncertainty. We establish that most formulations are at least NP-hard and thus suffer from the "'curse of uncertainty." The worst-case control of MDPs with rectangular uncertainty sets is equivalent to a zero-sum game between the controller and nature. The structural and comnputational results for such games make this formulation appealing. By adding a penalty for unlikely parameters, we extend the formulation of worst-case control of uncertain MDPs and mitigate its conservativeness. We show a duality between the penalized worst-case control of uncertain MDPs with rectangular uncertainty and the mninimization of a Markovian dynamically consistent convex risk measure of the sample cost. This notion of risk has desirable properties for multi-period decision making, including a new Markovian property that we introduce and motivate. This Markovian property is critical in establishing the equivalence between minimizing some risk measure of the sample cost and solving a certain zero-sum Markov game between the decision maker and nature, and to tackling infinite-horizon problems. An alternative approach to dealing with uncertain MDPs, which avoids the curse of uncertainty, is to exploit directly observational data. Specifically, we estimate the expected performance of any given policy (and its gradient with respect to certain policy parameters) from a training set comprising observed trajectories sampled under a known policy. We propose new value (and value gradient) estimators that are unbiased and have low training set to training set variance. We expect our approach to outperform competing approaches when there are few system observations compared to the underlying MDP size, as indicated by numerical experiments. Thesis Supervisor: D. Simester Title: Professor of Management Science Thesis Supervisor: J.N. Tsitsiklis Title: Professor of Electrical Engineering and Computer Science
منابع مشابه
Robust Fractional-order Control of Flexible-Joint Electrically Driven Robots
This paper presents a novel robust fractional PIλ controller design for flexible joint electrically driven robots. Because of using voltage control strategy, the proposed approach is free of problems arising from torque control strategy in the design and implementation. In fact, the motor's current includes the effects of nonlinearities and coupling in the robot manipulator. Therefore, cancella...
متن کاملStability Analysis and Robust PID Control of Cable Driven Robots Considering Elasticity in Cables
In this paper robust PID control of fully-constrained cable driven parallel manipulators with elastic cables is studied in detail. In dynamic analysis, it is assumed that the dominant dynamics of cable can be approximated by linear axial spring. To develop the idea of control for cable robots with elastic cables, a robust PID control for cable driven robots with ideal rigid cables is firstly de...
متن کاملTask-space Control of Electrically Driven Robots
Actuators of robot operate in the joint-space while the end-effect or of robot is controlled in the task-space. Therefore, designing a control system for a robotic system in the task-space requires the jacobian matrix information for transforming joint-space to task-space, which suffers from uncertainties. This paper deals with the robust task-space control of electrically driven robot manipula...
متن کاملRobust Fractional-order Control of Flexible-Joint Electrically Driven Robots
This paper presents a novel robust fractional PIλ controller design for flexible joint electrically driven robots. Because of using voltage control strategy, the proposed approach is free of problems arising from torque control strategy in the design and implementation. In fact, the motor's current includes the effects of nonlinearities and coupling in the robot manipulator. Therefore, cancella...
متن کاملDiscrete time robust control of robot manipulators in the task space using adaptive fuzzy estimator
This paper presents a discrete-time robust control for electrically driven robot manipulators in the task space. A novel discrete-time model-free control law is proposed by employing an adaptive fuzzy estimator for the compensation of the uncertainty including model uncertainty, external disturbances and discretization error. Parameters of the fuzzy estimator are adapted to minimize the estimat...
متن کاملAdaptive RBF network control for robot manipulators
TThe uncertainty estimation and compensation are challenging problems for the robust control of robot manipulators which are complex systems. This paper presents a novel decentralized model-free robust controller for electrically driven robot manipulators. As a novelty, the proposed controller employs a simple Gaussian Radial-Basis-Function Network as an uncertainty estimator. The proposed netw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007